Datasets

Numbers of sequences

method Repetition 1 Repetition 2 Repetition 3 Repetition 4 Repetition 5
Method 1 7328 7328 7328 7328 7328
Method 2 41510 41510 41510 41510 41510
Method 3 3582 3582 3582 3582 3582
Method 4 4151 4151 4151 4151 4151
Method 5 8355 8424 8412 8420 8377
Method 6 4151 4151 4151 4151 4151
Method 7 5140 5140 5140 5140 5140
Method 8 4151 4151 4151 4151 4151
Method 9 4151 4151 4151 4151 4151
Method 10 4151 4151 4151 4151 4151
Method 11 2968 2956 2982 2947 3016
Method 12 4151 4151 4151 4151 4151

Sequence length distributions

Amino acid composition

## [[1]]
## NULL
## 
## [[2]]
## NULL
## 
## [[3]]
## NULL
## 
## [[4]]
## NULL
## 
## [[5]]
## NULL
## 
## [[6]]
## NULL
## 
## [[7]]
## NULL
## 
## [[8]]
## NULL
## 
## [[9]]
## NULL
## 
## [[10]]
## NULL
## 
## [[11]]
## NULL
## 
## [[12]]
## NULL

Statistical significance of differences between replicates of each sampling method

Bigram composition

Selected physicochemical properties

prop description
BIGC670101 Residue volume (Bigelow, 1967)
ARGP820101 Hydrophobicity index (Argos et al., 1982)
CHAM820101 Polarizability parameter (Charton-Charton, 1982)
CHOP780201 Normalized frequency of alpha-helix (Chou-Fasman, 1978b)
CHOP780202 Normalized frequency of beta-sheet (Chou-Fasman, 1978b)
CHOP780203 Normalized frequency of beta-turn (Chou-Fasman, 1978b)
FASG760101 Molecular weight (Fasman, 1976)
FASG760104 pK-N (Fasman, 1976)
FASG760105 pK-C (Fasman, 1976)
FAUJ880103 Normalized van der Waals volume (Fauchere et al., 1988)
KLEP840101 Net charge (Klein et al., 1984)
KYTJ820101 Hydropathy index (Kyte-Doolittle, 1982)
ZIMJ680103 Polarity (Zimmerman et al., 1968)
ENGD860101 Hydrophobicity index (Engelman et al., 1986)
FASG890101 Hydrophobicity index (Fasman, 1989)

Physicochemical properties distribution

## [[1]]

## 
## [[2]]

## 
## [[3]]

## 
## [[4]]

## 
## [[5]]

## 
## [[6]]

## 
## [[7]]

## 
## [[8]]

## 
## [[9]]

## 
## [[10]]

## 
## [[11]]

## 
## [[12]]

## 
## [[13]]

## 
## [[14]]

## 
## [[15]]